To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????v??????????vB 0011111100111111001111110011111100111111001111110011111100111111001111110011111101110110001111110011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f3f7642
SJIS-WIN 杖??坎??趾?珥?v杖??坎??趾?珥?vB 10001111111100010011111100111111100110101010101000111111001111111110011011100100001111111110000011100000001111110111011010001111111100010011111100111111100110101010101000111111001111111110011011100100001111111110000011100000001111110111011001000010 8ff13f3f9aaa3f3fe6e43fe0e03f768ff13f3f9aaa3f3fe6e43fe0e03f7642
EUC-JP 杖?檉坎??趾?珥?v杖?檉坎??趾?珥?vB 1011111011110011001111111000111111000101101110111101010010101100001111110011111111101100111001100011111111100000111000100011111101110110101111101111001100111111100011111100010110111011110101001010110000111111001111111110110011100110001111111110000011100010001111110111011001000010 bef33f8fc5bbd4ac3f3fece63fe0e23f76bef33f8fc5bbd4ac3f3fece63fe0e23f7642
UTF-8 杖렚檉坎렫렏趾렗珥텡v杖렚檉坎렫렏趾렗珥텡vB 111001101001110110010110111010111010000010011010111001101010101010001001111001011001110110001110111010111010000010101011111010111010000010001111111010001011011010111110111010111010000010010111111001111000111110100101111011011000010110100001011101101110011010011101100101101110101110100000100110101110011010101010100010011110010110011101100011101110101110100000101010111110101110100000100011111110100010110110101111101110101110100000100101111110011110001111101001011110110110000101101000010111011001000010 e69d96eba09ae6aa89e59d8eeba0abeba08fe8b6beeba097e78fa5ed85a176e69d96eba09ae6aa89e59d8eeba0abeba08fe8b6beeba097e78fa5ed85a17642
UHC 杖렚檉坎렫렏趾렗珥텡v杖렚檉坎렫렏趾렗珥텡vB 11101101111010001000111010101101111011111110000011001010111011001000111010111001100011101010010111110010101111111000111010101100111011001011010011000101110111100111011011101101111010001000111010101101111011111110000011001010111011001000111010111001100011101010010111110010101111111000111010101100111011001011010011000101110111100111011001000010 ede88eadefe0caec8eb98ea5f2bf8eacecb4c5de76ede88eadefe0caec8eb98ea5f2bf8eacecb4c5de7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)