To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??猷??松??閻?【???誘り?魚 111000011001111100111111001111111001011101010001001111110011111110001111101111000011111100111111111010001000010100111111100000010111100100111111001111110011111110010111010101011000001011101000001111111000101110011011 e19f3f3f97513f3f8fbc3f3fe8853f81793f3f3f975582e83f8b9b
EUC-JP 癲??猷??松??閻?【???誘り?魚 111000101010000100111111001111111100110110110010001111110011111110111110101111100011111100111111111011111110010100111111101000011101101000111111001111110011111111001101101101101010010011101010001111111011010111111011 e2a13f3fcdb23f3fbebe3f3fefe53fa1da3f3f3fcdb6a4ea3fb5fb
UTF-8 癲숆낄猷쀩걬松쎌춳閻롮【溜띶짃誘り섬魚 111001111001100110110010111011001000100010000110111010111000001010000100111001111000110010110111111011001000000010101001111010101011000110101100111001101001110110111110111011001000111010001100111011001011011010110011111010011001011010111011111010111010000110101110111000111000000010010000111011111010011110001011111010111001110110110110111011001010011110000011111010001010101010011000111000111000001010001010111011001000010010101100111010011010110110011010 e799b2ec8886eb8284e78cb7ec80a9eab1ace69dbeec8e8cecb6b3e996bbeba1aee38090efa78beb9db6eca783e8aa98e3828aec84ace9ad9a
UHC 癲숆낄猷쀩걬松쎌춳閻롮【溜띶짃誘り섬魚 1110111110100110100110011110101010110011101001011110101110100011100101111110100110000001100101011110000111100110101111011110110010101101100011111110011110100010100011101110110010100001101111001110101011111110100011011110010110100011100100111110101110101111101010101110101010111100101101101110010111100000 efa699eab3a5eba397e98195e1e6bdecad8fe7a28eeca1bceafe8de5a393ebafaaeabcb6e5e0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)