To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 夜??揖??幽??菴??侑??儒??癲≪?? 100101101110100100111111001111111001011101001011001111110011111110010111010010000011111100111111111001001011110100111111001111111001100011010000001111110011111110001110111100100011111100111111111000011001111110000001111000010011111100111111 96e93f3f974b3f3f97483f3fe4bd3f3f98d03f3f8ef23f3fe19f81e13f3f
EUC-JP 夜??揖??幽??菴??侑??儒??癲≪?? 110011001110101100111111001111111100110110101100001111110011111111001101101010010011111100111111111010001011111100111111001111111101000011010010001111110011111110111100111101000011111100111111111000101010000110100010111000110011111100111111 cceb3f3fcdac3f3fcda93f3fe8bf3f3fd0d23f3fbcf43f3fe2a1a2e33f3f
UTF-8 夜껊씛揖졿뿿幽뚯뒓菴뀀맕侑쇤뇳儒섏탮癲≪궚柳 111001011010010010011100111010101011101110001010111011001001010010011011111001101000111110010110111011001010000110111111111010111011111110111111111001011011100110111101111010111001101010101111111010111001001010010011111010001000111110110100111010111000000010000000111010111010011110010101111001001011111010010001111011001000011110100100111010111000011110110011111001011000010010010010111011001000010010001111111011011000001110101110111001111001100110110010111000101000100110101010111010101011011010011010111011111010011110001001 e5a49ceabb8aec949be68f96eca1bfebbfbfe5b9bdeb9aafeb9293e88fb4eb8080eba795e4be91ec87a4eb87b3e58492ec848fed83aee799b2e289aaeab69aefa789
UHC 夜껊씛揖졿뿿幽뚯뒓菴뀀맕侑쇤뇳儒섏탮癲≪궚柳 1110010110101000100000111110101110011101101100001110101111100111101000001110011010010111101111111110101011101011100011001110110010001010100100001110010011100000101100101110101110010000101001111110101011100010101111001110100110000111100101111110101011100011100110001110110010110101100011101110111110100110101000011110110010000010101011111110101011110111 e5a883eb9db0ebe7a0e697bfeaeb8cec8a90e4e0b2eb90a7eae2bce98797eae398ecb58eefa6a1ec82afeaf7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)